NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

SEESys: Online Pose Error Estimation System for Visual SLAM

https://doi.org/10.1145/3666025.3699341

Hu, Tianyi; Scargill, Tim; Yang, Fan; Chen, Ying; Lan, Guohao; Gorlatova, Maria (November 2024, ACM)

Full Text Available
EyeSyn: Psychology-inspired Eye Movement Synthesis for Gaze-based Activity Recognition

https://doi.org/10.1109/IPSN54338.2022.00026

Lan, Guohao; Scargill, Tim; Gorlatova, Maria (May 2022, ACM/IEEE IPSN)

Recent advances in eye tracking have given birth to a new genre of gaze-based context sensing applications, ranging from cognitive load estimation to emotion recognition. To achieve state-of-the-art recognition accuracy, a large-scale, labeled eye movement dataset is needed to train deep learning-based classifiers. However, due to the heterogeneity in human visual behavior, as well as the labor-intensive and privacy-compromising data collection process, datasets for gaze-based activity recognition are scarce and hard to collect. To alleviate the sparse gaze data problem, we present EyeSyn, a novel suite of psychology-inspired generative models that leverages only publicly available images and videos to synthesize a realistic and arbitrarily large eye movement dataset. Taking gaze-based museum activity recognition as a case study, our evaluation demonstrates that EyeSyn can not only replicate the distinct pat-terns in the actual gaze signals that are captured by an eye tracking device, but also simulate the signal diversity that results from different measurement setups and subject heterogeneity. Moreover, in the few-shot learning scenario, EyeSyn can be readily incorporated with either transfer learning or meta-learning to achieve 90% accuracy, without the need for a large-scale dataset for training.
more » « less
Full Text Available
GazeGraph: graph-based few-shot cognitive context sensing from human visual behavior

https://doi.org/10.1145/3384419.3430774

Lan, Guohao; Heit, Bailey; Scargill, Tim; Gorlatova, Maria (November 2020, Proceedings of the 18th Conference on Embedded Networked Sensor Systems)
null (Ed.)
In this work, we present GazeGraph, a system that leverages human gazes as the sensing modality for cognitive context sensing. GazeGraph is a generalized framework that is compatible with different eye trackers and supports various gaze-based sensing applications. It ensures high sensing performance in the presence of heterogeneity of human visual behavior, and enables quick system adaptation to unseen sensing scenarios with few-shot instances. To achieve these capabilities, we introduce the spatial-temporal gaze graphs and the deep learning-based representation learning method to extract powerful and generalized features from the eye movements for context sensing. Furthermore, we develop a few-shot gaze graph learning module that adapts the `learning to learn' concept from meta-learning to enable quick system adaptation in a data-efficient manner. Our evaluation demonstrates that GazeGraph outperforms the existing solutions in recognition accuracy by 45% on average over three datasets. Moreover, in few-shot learning scenarios, GazeGraph outperforms the transfer learning-based approach by 19% to 30%, while reducing the system adaptation time by 80%.
more » « less
Full Text Available
CollabAR: Edge-assisted Collaborative Image Recognition for Mobile Augmented Reality

https://doi.org/10.1109/IPSN48710.2020.00-26

Liu, Zida; Lan, Guohao; Stojkovic, Jovan; Zhang, Yunfan; Joe-Wong, Carlee; Gorlatova, Maria (April 2020, ACM/IEEE International Conference on Information Processing in Sensor Networks (IPSN))

Mobile Augmented Reality (AR), which overlays digital content on the real-world scenes surrounding a user, is bringing immersive interactive experiences where the real and virtual worlds are tightly coupled. To enable seamless and precise AR experiences, an image recognition system that can accurately recognize the object in the camera view with low system latency is required. However, due to the pervasiveness and severity of image distortions, an effective and robust image recognition solution for mobile AR is still elusive. In this paper, we present CollabAR, an edge-assisted system that provides distortion-tolerant image recognition for mobile AR with imperceptible system latency. CollabAR incorporates both distortion-tolerant and collaborative image recognition modules in its design. The former enables distortion-adaptive image recognition to improve the robustness against image distortions, while the latter exploits the `spatial-temporal' correlation among mobile AR users to improve recognition accuracy. We implement CollabAR on four different commodity devices, and evaluate its performance on two multi-view image datasets. Our evaluation demonstrates that CollabAR achieves over 96% recognition accuracy for images with severe distortions, while reducing the end-to-end system latency to as low as 17.8ms for commodity mobile devices.
more » « less
Full Text Available
Wireless Sensing Using Dynamic Metasurface Antennas: Challenges and Opportunities

https://doi.org/10.1109/MCOM.001.1900696

Lan, Guohao; Imani, Mohammadreza F.; Hougne, Philipp del; Hu, Wenjun; Smith, David R.; Gorlatova, Maria (June 2020, IEEE Communications Magazine)

Full Text Available
Invited Paper: Edge-based Provisioning of Holographic Content for Contextual and Personalized Augmented Reality

https://doi.org/10.1109/PerComWorkshops48775.2020.9156256

Glushakov, Michael; Zhang, Yunfan; Han, Yuqi; Scargill, Timothy James; Lan, Guohao; Gorlatova, Maria (March 2020, IEEE International Conference on Pervasive Computing and Communications Workshops (PerCom Workshops))

Mobile augmented reality (AR) has been attracting considerable attention from industry and academia due to its potential to provide vibrant immersive experiences that seamlessly blend physical and virtual worlds. In this paper we focus on creating contextual and personalized AR experiences via edge-based on-demand provisioning of holographic content most appropriate for the conditions and/or most matching user interests. We present edge-based hologram provisioning and pre-provisioning frameworks we developed for Google ARCore and Magic Leap One AR experiences, and describe open challenges and research directions associated with this approach to holographic content storage and transfer. The code we have developed for this paper is available online.
more » « less
Full Text Available

Search for: All records